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Mechanical Translation 

New Challenge to Communication 

Jacob Ornstein 


One of the thorniest problems in pres- 
ent-day communication is the translation 
of important writings, especially in the 
fields of science and technology, from one 
language to another. The machine age 
has done little to alter the translation 
process, which today, as it was centuries 
ago, must be laboriously performed at 
considerable expense by human beings. 
Moreover, grqwing concern over the dan- 
gerous lag bjetween the appearance of 
important wo)rks in other languages and 
their translation into English and other 
tongues has Within the past decade fo- 
cused serious; attention on the problem 
of mechanical translation. Until recently, 
however, mechanical translation has re- 
mained one o!f those human dreams the 
realization of which was relegated to 
some point ill the unpredictable future. 

It was with the development of the 
electronic computer that real hope came 
to be felt concerning the possibility of 
mechanical translation. The man who 
first envisioned the possibility of transla- 
tion by electronic means was apparently 
Warren Weaver, director of the Natural 
Sciences Division of the Rockefeller 
Foundation. First of all in 1945, and 
more concretely in a memorandum dated 
15 July 1949, Weaver raised the question 
of the feasibility of designing a computer- 
like machine capable of translating from 
one language to another. This memoran- 
dum was circulated among a number of 


linguists and scientists; their reactions 
ranged from high optimism to complete 
skepticism. Weaver's concepts, neverthe- 
less, aroused considerable interest and 
stimulated preliminary research in the 
field. 

The problem of mechanical transla- 
tion is being approached by a growing 
number of scholars, although much of 
the research is in the realm of speculation 
and theory. Among mathematicians, one 
may mention such figures as Ychoshua 
Bar-Hillel, formerly of Massachusetts 
Institute of Technology and now of 
Hebrew University, whose articles on 
mechanical translation were accorded 
considerable space in the 5 April 1954 
issue of Time , the weekly news maga- 
zine. Important research is being carried 
on at Massachusetts Institute of Tech- 
nology by Victor H. Yngve of the Re- 
search Laboratory of Electronics and 
William N. Locke, professor of modem 
languages. Anthony Oettinger of Harvard 
University's Computation Laboratory 
has also been conducting experiments. 
Erwin Reifler of the University of Wash- 
ington has concerned himself with the 
elaboration of a mechanical dictionary. 
An active group, headed by William Bull 
and Victor Oswald, is working on prob- 
lems of mechanical translation at the 
University of California at Los Angeles. 

The first successful experiment in per- 
forming mechanical translation was the 


result of a joint project undertaken by 
Georgetown University's Institute of 
Languages and Linguistics, Washington, 
D.C., and the International Business Ma- 
chines Corporation. Considerable public- 
ity followed their demonstration at 
I.B.M. headquarters in New York on 
7 January 1954 of the translation of more 
than 60 sentences from Russian into Eng- 
lish. An invited group of government offi- 
cials, linguists, and scientists watched a 
typist who knew no Russian type the 
sentences, which had been transliterated 
into the Roman alphabet, on an electric 
card punch and feed them into the “elec- 
tronic translator,” which produced ac- 
curate English translations. The sen- 
tences were from the workaday fields of 
science, technology, communications, 
and international affairs. The following 
are a few examples of the transliterated 
Russian sentences and the English equiv- 
alents: 

Myezhdunarodnoye ponyimanyiye yav- 
lyayetsya vazhnim faktorom v ryeshyen- 
yiyi polyityichyeskyix voprosov. 

International understanding consti- 
tutes an important factor in decision of 
political questions. 

Dorogi stroyatsya yiz byetona. 

Roads are constructed from concrete. 

Komandyir poluchayet svyedyenyiya 
po tyelyegrafu. 

A commander gets information over a 
telegraph. 

Vyelyichyina ugla opryedyelyayetsya 
otnoshyenyiyem dlyiny dugi k radyiusu. 

Magnitude of angle is determined by 
the relation of length of arc to radius. 

Obrabotka povyshayet kachyestvo 
nyeftyi. 

Processing improves the quality of 
crude oil. 

It is revealing to consider in some de- 
tail the background of the Georgetown- 
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Fig. 1. Dictionary syntax flow chart showing the logical procedure followed by 
701 electronic data-processing machine in translating from one language to another. 


I.B.M. experiment, its accomplishments, 
and the implications that it has for the 
future. Among other things, it is the ac- 
count of an experience in which the hu- 
manities, represented by languages, 
joined forces with science and technology 
to solve a problem in human communi- 
cation. 

Early in 1952, Leon Dostert, director 
of the Georgetown University Institute of 
Language and Linguistics, was invited 


participate in the first Conference on 
Mechanical Translation, which was held 
with the support of the Rockefeller 
Foundation at Massachusetts Institute of 
Technology. Highly skeptical of the fea- 
sibility of any such process, Dostert pre- 
pared a talk expressing his grave doubts. 
However, on his way to Cambridge, he 
began to question his own skepticism and 
destroyed his script. Arriving in a more 
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proceedings of the conference to believe 
that nothing would ever be accomplished 
unless the matter went beyond the realm 
of mere speculation. 

Armed with pertinent linguistic data, 
Dostert approached the International 
Business Machines Corporation, where 
he was accorded a sympathetic recep- 
tion. Thomas J. Watson, chairman of 
the board, authorized and encouraged the 
project, which was conceived in terms of 
a trial run of a glossary of 250 Russian 
words. Cuthbert C. Hurd, now I.B.M.’s 
director of electronic data-processing 
machines, and Peter Sheridan, mathe- 
matician, concerned themselves with the 
technical and scientific aspects of the un- 
dertaking. 

On the linguistic side, it was Dostert 
who was the moving force behind the 
experiment. He largely delegated the 
problems of linguistic analysis to Paul 
Garvin, anthropologist and linguist who 
is acquainted with a wide range of Euro- 
pean and other tongues. Two years of 
close collaboration followed in which the 
most recent findings of linguistics and of 
mathematical logic were harnessed to 
find a solution to the problem. 

The demonstration that took place in 
January 1954 represented the successful 
completion of the first phase of the joint 
experiment as well as tangible proof that 
machine translation is possible. The en- 
thusiasm of the publicity surrounding 
the demonstration tended to create the 
impression that the problems of auto- 
matic translation had largely been solved. 
This docs not correspond to the reality 
of the situation. Much still remains to be 
done. Dostert, wishing to curb the tend- 
ency to describe the results of the dem- 
onstration in excessively glowing terms, 
has repeatedly referred to it as the 
“Kitty Hawk” of the experiment. 

At the present stage of development, 
the translation machine actually consists 
of an I.B.M. data-processing machine 
type 701, which is known popularly as 
the “mechanical brain” and which is 
capable of performing numerous arith- 
metical and logical operations at very 
great speeds. To the layman, it looks 
like an assortment of 11 complicated 
electronic units, not unlike modern 
kitchen ranges, connected by cables to 
function as a unit. 

The translator has a vocabulary of 250 
words. To prepare it for the experiment, 
each word was punched on a punch 
card, together with its English equivalent 
or equivalents and three codes that were 
designated first, second, and third. The 
information on the punch cards was 
stored in the form of plus and minus 
charges on the magnetic drums. This 
occupied the space of 6000 “machine 
words” of 36 binary digits each. Follow- 
ing this, the programs developed for 
nto the machine. 
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Each instruction written in ordinary 
English had to be converted into terms 
of detailed programs for the individual 
computations of the machine. A single 
sentence of verbal instructions might re- 
quire 100 computations on the part of 
the machine. The program steps for the 
machine, prepared by Sheridan, totaled 
about 2400. The dictionary syntax flow 
chart shown in Fig. 1 gives an idea of 
what the programming represents. 

The designers also elaborated three 
codes that indicated to the machine 
which of the six rules of operational syn- 
tax applies to each word inserted for 
translation. These rules, or rule-tags, 
which were developed by Garvin, gov- 
ern the transposition of words, choice of 
meanings where there are several alterna- 
tives, omission of unnecessary words, and 
insertion of necessary words. Their role, 
therefore, is to order the units so that the 
words will not come out a mere jumble. 

Let us take a few examples of the 
functioning of the rules. The Russian 
words gyeneral mayor indicate a rank 
roughly equivalent to major general. Ob- 
viously the two words must be reversed. 
The switch is assured in advance by at- 
taching the rule-tag 21 both to the Rus- 
sian word gyeneral and to its translation 
in the bilingual glossary stored in the 
machine, and by attaching rule-tag 110 
to the Russian word mayor and its trans- 
lation. According to the stored instruc- 
tions, whenever a rule-tag 110 is encoun- 
tered in the glossary, it is necessary to go 
back and look for a rule-tag 21. If a 21 
is found, the two translations must be 
printed in reverse order. Thus the trans- 
lation major of mayor is printed before 
the translation general of gyeneral. 

One more illustration is worth noting. 
The Russian word o can mean either 
about or of. In the Russian-English glos- 
sary nauka has affixed to it the rule-tag 
242 and o carries the rule-tag 141. The 
instructions indicate to the machine that 
whenever rule-tag 141 is encountered, it 
is necessary to go back and search for 
241 or 242. If 241 is found, the first Eng- 
lish translation is selected and both words 
are printed in the order in which they 
appear in the Russian sentence. If 242 
is encountered, the second English mean- 
ing is selected. Consequently, the com- 
puter reads the 141, looks for and finds 
242, chooses the second meaning given 
for o y which is of, and prints correctly 
science of. 

Table 1 contains the three codes and 
the six Garvin rules of syntax; Fig. 2 il- 
lustrates how a source sentence is trans- 
lated, analyzed, and arranged in correct 
English word order by the converted 


been chosen. One reason for the selection 
of Russian was its strategic importance; 
another was the fact that there is a rela- 
tively small number of persons competent 
to handle Russian, while the accumula- 
tion of untranslated works in that tongue 
continues to increase at an alarming rate. 
This accumulation constitutes an over- 
flowing reservoir of data about the Soviet 
Union — books, newspapers, and journals 
available in Russian to any in [crested 
party. Moreover, the Georgetown lin- 
guists felt that in view of the highly in- 
flected nature of the language, which is 
similar to Greek and Latin in its multi- 
plicity of endings, a mechanical trans- 
lator capable of handling Russian would 
yield significant experience for dealing 
with other complicated languages. 

A preponderance of sentences from 
scientific and technical fields was chosen 
for the trial run, not only because of ob- 
vious timeliness, but also because writing 
in scientific and technical fields is done 
with words having highly specialized and 
precise meanings. Ilcncc, if a word ap- 
pears in a certain context, the chances 
of its having a single, unambiguous mean- 
ing arc extremely high. The same possi- 
bilities for accurate prediction occur in 
other fields of technical writing such as 
medicine and engineering. Consequently, 
the Georgetown linguists assume that 
electronic translation will begin with 
separate dictionaries for each technical 
area; as experience with these areas 
grows, enough will be learned eventually 
to permit accurate translation of our 
common everyday language as well. 

Two major problems exist for design- 
ers of mechanical translators. The first is 
that of reducing the number of synonyms 
of any given word for storage in the me- 
chanical dictionary. This is rendered dif- 
ficult by the fact that compilers of con- 
ventional dictionaries have persisted in 
giving an unnecessarily large number of 
equivalents for each entry. The experi- 
ence with the electronic translator has 
indicated that improved selective tech- 
niques based on the principles of pres- 
ent-day linguistics will help provide 
answers to this problem. 

The second difficulty is that of “opera- 
tional syntax,” which in mechanical 
translation parlance means the obtaining 
of an intelligible output sequence from 
an input with a different sequence of 
elements. This can be solved only by pre- 
paring detailed linguistic instructions 
that in turn must be converted into elec- 
tronic instructions to the computer. Onl^ 
by such means can machines produce inj- 
telligent translations relatively free of 
ambiguities. 
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Fig. 2. Diagram illustrating how a source 
sentence is translated, analyzed, and ar- 
ranged into the target sentence in correct 
English word order by the I.B.M. 701 
electronic data-processing machine. See 
Table 1 for rules of operational syntax. 


Table 1. Rules of operational syntax to 
accompany Fig. 2 

Rule 1 : Rearrangement. If first code is 
110, is third code associated with preced- 
ing complete word equal to 21? If so, 
reverse order of appearance of words in 
output (that is, word carrying 21 should 
follow that carrying 110); otherwise, re- 
tain order. In both cases, English equiva- 
lent I associated with 110 is adopted. 

Rule 2: Choice — Following Text.. If 
first code is 121, is second code of the fol- 
lowing complete, subdivided, or partial 
(root or cncfing) word equal to 221 or 
222? If it is 221, adopt English equivalent 
I of word carrying 1 2 1 ; if it is 222, adopt 
English equivalent II. In both cases, 
retain order of appearance of output 
words. 

Rule 3: Choice — Rearrangement. If 
first code is 131, is third code of preceding 
complete word or either portion (root or 
ending) of preceding subdivided word 
equal to 23? If so, adopt English equiva- 
lent II of word carrying 131 and retain 
order of appearance of words in output; 
if not, adopt English equivalent I and re- 
verse order of appearance of words in 
output. 

Rule 4: Choice-Previous Text. If first 
code is 141, is second code of preceding 
complete word or either portion (root or 
ending) of preceding subdivided word 
equal to 241 or 242? If it is 241, adopt 
English equivalent I of word carrying 141 ; 
if it is 242, adopt English equivalent II. 
In both cases, retain order of appearance 
of words in output. 

Rule 5: Choice — Omission. If first code 
is 151, is third code of following complete 
word or either portion (root or ending) 
of following subdivided word equal to 25? 
If so, adopt English equivalent II of word 
carrying 151 ; if not, adopt English equiva- 
lent I. In both cases, retain order of ap- 
pearance of words in output. 

Rule 6: Subdivision. If first code asso- 


I.B.M. 701 electronic data-processing The converted 701 computer with its dated with a Russian dictionary word is 

machine. 250-word vocabulary and six Garvin ***> th f n adopt English equivalent I of 

The Russian language was chosen by rules of operational syntax is still too alternative English language equivalents, 

the designers, but any one of the 2000- limited to be effective for full-scale trans- retaining order of appearance of output 

odd languages of the world might have lation. This means that designers must wit respect to previous wor 
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embark upon a new phase of develop- 
ment, seeking both to amplify the vo- 
cabulary and to increase the ability of 
the machine to handle a wider range of 
operational syntax. The six; rules will 
have to be vastly increased in number. 
Dostert estimates that about 100 rules 
would be needed to govern a vocabulary 
of 20,000 words. All this will require a 
great deal of additional linguistic re- 
search, as well as a special study, from 
the mechanical-translation point of view, 
of the source language vis-a-vis the target 
language. From the technical standpoint, 
this will involve increasing the machine 
storage and possibly the devising of spe- 
cial circuits. It will be necessary to ana- 
lyze translation samples of increasing 
length and complexity and to adjust the 
results progressively after each analysis. 
The validity of the programming must 
continually be tested against machine 
equipment. 

Looking realistically at the electronic 
translator, its developers recognize that 
the I.B.M. 701, which costs about $500,- 
000, is “overdesigned” for language 
translation; it has many functions not es- 
sential to this task that were built in to 
solve problems in astronomy and physics. 
The bulkiness of the 1 1 units, which oc- 
cupy roughly the same area as a tennis 
court, is another drawback. According to 
Hurd, I.B.M. is considering the develop- 
ment of a machine exclusively intended 


for translation. However, this cannot be 
accomplished until the Georgetown lan- 
guage specialists have elabor ated addi 
tional instructions and sp •cificatiom 
based on the experience tha has been 
gained in the first phase ol the joim 
project. 

The time is still distant when it will b<- 
possible to insert a Russian scientific book 
into a translating machine anc to receive 
from it an acceptable translation. It is 
highly doubtful that any machine could 
ever be devised that is capable of per 
forming satisfactory translatio is of works 
of fiction or writings in which fine shades 
of meaning and subjective intc rpretations 
are involved. The translation of such a 
work as Milton’s Paradise Last or Tol- 
stoy’s War and Peace is in it: elf a crea- 
tive act. Nor would mechani al devices 
ever eliminate the need for professional 
human translators. On the contrary, the 
latter would be freed from d ill, routine 
hackwork and could devote themselves 
to the translation of works of literary and 
artistic merit that are more hallcnging 
in nature. 

Neil MacDonald, describing tin- 
George town- I.B.M. project ir his article 
“Language translation by machine," 
which was published in th ■ February 
1954 issue of Computers an l Automa- 
tion , pointed out that the search for tin* 
solution of the translation problem 
brought to light many new fa< ts that will 


tend to bridge the gap between the hu- 
manities and science. For example, it was 
found that the formulation of the logic 
required to convert word meanings prop- 
erly, even in a small segment of two lan- 
guages, necessitated as many instructions 
to the computer as are required to simu- 
late the flight of a guided missile. He 
predicted that in the future 

“Linguists will be able to study a lan- 
guage in the way that a physicist studies 
material in physics, with very few human 
prejudices and preconceptions, because 
the language has to be reduced to its 
operational characteristics in order to be 
handled electronically.” All this suggests 
that a new discipline may emerge in 
which science and linguistics combine to 
solve international problems of human 
communication. 

It may not be too visionary to suggest 
that with the perfecting of mechanical 
translation, significant writings of one 
land will be made available to interested 
readers in other countries shortly after 
they emerge from the presses. Thus, un- 
derdeveloped areas of the globe could re- 
ceive the benefits of advanced science, 
technology, and knowledge at a rate im- 
possible today. The general problem of 
the lag in the translation of key works 
from one language to another will cer- 
tainly be remedied proportionally as ma- 
chine translation becomes better devel- 
oped. 
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